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HPV 52 LI Nucleotide Sequence Alignment 



52 LI wt ( 1) ATGTCCGTGTGGCGGCCTAGTGAGGCCACTGTGTACCTGCCTCCTGTACC 

52 LI R C. . .A. A. .ATCC. .A. .T. . . .C. . .T A. .A. . .T. . 

52 LI wt (51) TGTCTCTAAGGTTGTAAGCACTGATGAGTATGTGTCTCGCACAAGCATCT 

52 LI R A CTCT..C..C..A..C. .C. .CA.A. .CTC 

52 LI wt (101) ATTATTATGCAGGCAGTTCTCGATTACTAACAGTAGGACATCCCTATTTT 

52 LI R .C. .C. .C. .T. .TTCC. . .A. . . .GT.G. .T. .C. .T. .C. .A. .C. .C 

52 LI wt (151) TCTATTAAAAACACCAGTAGTGGTAATGGTAAAAAAGTTTTAGTTCCCAA 

52 LI R C. .G TCCTCC C G. .G. .C. .G A. . 

52 LI wt ( 201) GGTGTCTGGCCTGCAATACAGGGTATTTAGAATTAAATTGCCGGACCCTA 

52 LI R . . .C TT A. .C. .C C. .G A A. 

52 LI wt (251) ATAAATTTGGTTTTCCAGATACATCTTTTTATAACCCAGAAACCCAAAGG 

52 LI R .C..G..C C C..TAG...C. ,C T A 

52 LI wt (301) TTGGTGTGGGCCTGTACAGGCTTGGAAATTGGTAGGGGACAGCCTTTAGG 

52 LI R C.....T T..T C A. .T. .A. .A. .G. . 

52 LI wt ( 351) TGTGGGTATTAGTGGGCATCCTTTATTAAACAAGTTTGATGATACTGAAA 

52 LI R ...C CTC...T..C..A. .G..G.. C..C..C 

52 LI wt (401) CCAGTAACAAATATGCTGGTAAACCTGGTATAGATAATAGGGAATGTTTA 

52 LI R ..TC G..C .G..A C C..A G 

52 LI wt (451) TCTATGGATTATAAGCAGACTCAGTTATGCATTTTAGGATGCAAACCTCC 

52 LI R C..C A A..G..T..C..G..T..T..G..A.. 

52 LI wt (501) TATAGGTGAACATTGGGGTAAGGGAACCCCTTGTAATAATAATTCAGGAA 

52 LI R A..C C T..T..A C. .C. .C. .T. .T. 

52 LI wt (551) ATCCTGGGGATTGTCCTCCCCTACAGCTCATTAACAGTGTAATACAGGAT 

52 LI R .C. .A. .T. .C A. . AT . G . .AT.G. -C. . .TCC. .C. .C. .A. .C 

52 LI wt (601) GGGGACATGGTAGATACAGGATTTGGTTGCATGGATTTTAATACCTTGCA 

52 LI R ..T C..C..T..T..C T C..C..C 



FIG.1A 
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52 LI wt 
52 LI R 



(651) AGCTAGTAAAAGTGATGTGCCCATTGATATATGTAGCAGTGFATGTAAGT 

• • • • TC ■ • * GTCC ■ . C • • C • .A. . C • • C • . C • * • T"C . TC 0 . 



52 LI wt 
52 LI R 



(701) ATCCAGATTATTTGCAAATGGCTAGCGAGCCATATGGTGACAvGTTTGTTC 

.c c. .c tct. .a c rcc 



52 LI wt 
52 LI R 



(751) TTTTTTCTTAGACGTGAGCAAATGTTTGTTAGACACTTTTTT^ATAGGGC 

. .C. .CT.G. . .A. A. .A C. .C C. .C . .C. .A. . 



52 LI wt 
52 LI R 



(801) CGGTACCTTAGGTGACCCTGTGCCAGGTGATTTATATATACA^AGGGTCTA 
T G A. .T C. .G. .C..C T. .C. 



52 LI wt 
52 LI R 



( 851) ACTCTGGCAATACTGCCACTGTAC AAAGCAGTGCTTTTTTTC CTACTC CT 

T..C T C. . .TC.TC C. .C. .A A 



52 LI wt 
52 LI R 



(901) AGTGGTTCTATGGTAACCTCAGAATCCCAATTATTTAATAAACCGTACTG 
TC C C C G. .C. .C. .S . .A 



52 LI wt 
52 LX R 



( 951) GTTACAACGTGCGCAGGGCCACAATAATGGCATATGTTGGGGCAATCAGT 

...G...A.A..T..A..T C..C..T..C T..C..A. 



52 LI wt 
52 LI R 



( 1001) TGTTTGTCACAGTTGTGGATACCACTCGTAGCACTAACATGACTTTATGT 

C C. .C..C. .C. .T. . .A.ATCT C. .G. . . 



52 LI wt 
52 LI R 



( 1051 ) GCTGAGGTTAAAAAGGAAAGCACATATAAAAATGAAAATTTTy\AGGAATA 

A..C. .G TC...C..C..G..C C..C 



52 LI wt 
52 LI R 



(1101) CCTTCGTCATGGCGAGGMTTTGATTTACMTTTATTTTTCAjATTGTGCA 

.T.GA.A. .C..T..A C..C..G C. .C..C T. 



52 LI wt 
52 LI R 

52 LI wt 
52 LI R 

52 LI wt 
52 LI R 



( 1151 ) AAATTACATTAACAGCTGATGTTATGACATACATTCATAAGATGGATGCC 

.G. .C. .C. .G. X C. .C T C. .C C. .T 

(1201) ACTATTTTAGAGGACTGGCAATTTGGCCTTACCCCACCACCGTCTGCATC 

C. .G. .A C. . TT . G . .T A . .C. .T. . 

(1251) TTTGGAGGACACATACAGATTTGTCACTTCTACTGCTATAACTTGTCAAA 
C A T C C C. .C 



FIG. 1 A 
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52 LI wt (1301) AAAACACGCCACCTAAAGGAAAGGAAGATCCTTTAAAGGACTATATGTTT 

52 LI R .G T A..G..T C..A..G C C 

52 LI wt (1351) TGGGAGGTGGATTTAAAAGAAAAGTTTTCTGCAGATTTAGATCAGTTTCC 

52 LI R A..C..C..G..G C T. .C. .G. .C. .A . .C. . 

52 LI wt (1401) TTTAGGTAGGAAGTTTTTGTTACAGGCAGGGCTACAGGCTAGGCCCAAAC 

52 LI R A. .G. A C G. .A. .T. .TT.G. .A A...A..GT 

52 LI wt (1451) TAAAACGCCCTGCATCATCGGCCCCACGTACCTCCACAAAGAAGA>!\AAAG 

52 LI R .G. . GA . A . .A. .TAGC. .T. .T. . .A. A. .T C G. . . 

52 LI wt (1501) GTTAAAAGGTAA (SEQ ID NO: 3) 

52 LI R ..C..G..A...CSEQ ID N0:1) 



FIG.1C 
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HPV 52 LI R Nucleotide and Amino Acid Sequences 

MSVW RPS EAT VYLP PVP 
1 ATGTCCGTCT GGAGACCATC CGAAGCTACT GTCTACTTGC CACCAGTTCC 
TACAGGCAGA CCTCTGGTAG GCTTCGATGA CAGATGAACG GTGGTCAAGG 
GSK VVST DEY V SR TSIY 
51 AGTCTCTAAG GTTGTCTCTA CCGACGAATA CGTCTCCAGA ACCTCCATCT 
TCAGAGATTC CAACAGAGAT GGCTGCTTAT GCAGAGGTCT TGGAGGTAGA 
YYA GSS RLLT VGH PYF 
101 ACTACTACGC TGGTTCCTCT AGATTGTTGA CTGTCGGTCA CCCATACTTC 
TGATGATGCG ACCAAGGAGA TCTAACAACT GACAGCCAGT GGGTATGAAG 
SIKN TSS GNG KKVL VPK 
151 TCTATCAAGA ACACCTCCTC CGGTAACGGT AAGAAGGTCT TGGTTCCAAA 
AGATAGTTCT TGTGGAGGAG GCCATTGCCA TTCTTCCAGA ACCAAGGTTT 
VSG LQYR VFR I K L PDPN 
201 GGTCTCTGGT TTGCAATACA GAGTCTTCAG AATCAAGTTG CCAGACCCAA 
CCAGAGACCA AACGTTATGT CTCAGAAGTC TTAGTTCAAC GGTCTGGGTT 
KFG FPD TSFY NPE TQR 
251 ACAAGTTCGG TTTCCCAGAC ACTAGTTTCT ACAACCCAGA AACTCAAAGA 
TGTTCAAGCC AAAGGGTCTG TGATCAAAGA TGTTGGGTCT TTGAGTTTCT 
LVWA CTG LEI GRGQ PLG 
301 TTGGTCTGGG CTTGTACTGG TTTGGAAATC GGTAGAGGTC AACCATTGGG 
AACCAGACCC GAACATGACC AAACCTTTAG CCATCTCCAG TTGGTAACCC 
VGI SGHP LLN KFD DTET 
351 TGTCGGTATC TCTGGTCACC CATTGTTGAA CAAGTTCGAC GACACTGAAA 
ACAGCCATAG AGACCAGTGG GTAACAACTT GTTCAAGCTG CTGTGACTTT 
SNK YAG KPGI DNR ECL 
401 CCTCTAACAA GTACGCTGGT AAGCCAGGTA TCGATAACAG AGAATGTTTG 
GGAGATTGTT CATGCGACCA TTCGGTCCAT AGCTATTGTC TCTTACAAAC 
SMDY KQT QLC ILGC KPP 
451 TCTATGGACT ACAAGCAAAC TCAATTGTGT ATCTTGGGTT GTAAGCCACC 
AGATACCTGA TGTTCGTTTG AGTTAACACA TAGAACCCAA CATTCGGTGG 
IG E HWGK GTP CNN NSGN 
501 AATCGGTGAA CACTGGGGTA AGGGTACTCC ATGTAACAAC AACTCTGGTA 
TTAGCCACTT GTGACCCCAT TCCCATGAGG TACATTGTTG TTGAGACCAT 
PGD CPP LQLI NSV I Q D 
551 ACCCAGGTGA CTGTCCACCA TTGCAATTGA TCAACTCCGT CATCCAAGAC 
TGGGTCCACT GACAGGTGGT AACGTTAACT AGTTGAGGCA GTAGGTTCTG 
GDMV DTG FGC MDFN TLQ 
601 GGTGACATGG TCGACACTGG TTTCGGTTGT ATGGACTTCA ACACCTTGCA 
CCACTGTACC AGCTGTGACC AAAGCCAACA TACCTGAAGT TGTGGAACGT 

FIG.2A 
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ASK SDVP IDI CSS VCKY 
651 AGCTTCTAAG TCCGACGTCC CAATCGACAT CTGTTCCTCT GTCTGTAAGT 
TCGAAGATTC AGGCTGCAGG GTTAGCTGTA GACAAGGAGA CAGACATTCA 
PDY LQM ASEP YGD SLF 
701 ACCCAGACTA CTTGCAAATG GCTTCTGAAC CATACGGTGA CTCCTTGTTC 
TGGGTCTGAT GAACGTTTAC CGAAGACTTG GTATGCCACT GAGGAACAAG 
FFLR REQ MFV RHFF N R A 
751 TTCTTCTTGA GAAGAGAACA AATGTTCGTC AGACACTTCT TCAACAGAGC 
AAGAAGAACT CTTCTCTTGT TTACAAGCAG TCTGTGAAGA AGTTGTCTCG 
GTL GDPV PGD LYI QGSN 
801 TGGTACCTTG GGTGACCCAG TTCCAGGTGA CTTGTACATC CAAGGTTCCA 
ACCATGGAAC CCACTGGGTC AAGGTCCACT GAACATGTAG GTTCCAAGGT 
SGN TAT VQSS AFF PTP 
851 ACTCTGGTAA CACTGCTACT GTCCAATCCT CTGCTTTCTT CCCAACTCCA 
TGAGACCATT GTGACGATGA CAGGTTAGGA GACGAAAGAA GGGTTGAGGT" 
SGSM VTS ESQ LFNK PYW 
901 TCTGGTTCCA TGGTCACCTC CGAATCCCAA TTGTTCAACA AGCCATACTG 
AGACCAAGGT ACCAGTGGAG GCTTAGGGTT AACAAGTTGT TCGGTATGAC 
LQR AQGH NNG ICW GNQL 
951 GTTGCAAAGA GCTCAAGGTC ACAACAACGG TATCTGTTGG GGTAACCAAT" 
CAACGTTTCT CGAGTTCCAG TGTTGTTGCC ATAGACAACC CCATTGGTTA. 
FVT VVD TTRS TNM TLC 
1001 TGTTCGTCAC CGTCGTCGAC ACTACTAGAT CTACTAACAT GACCTTGTGT" 
ACAAGCAGTG GCAGCAGCTG TGATGATCTA GATGATTGTA CTGGAACACAv 
AEVK KES TYK NENF KEY 
1051 GCTGAAGTCA AGAAGGAATC CACCTACAAG AACGAAAACT TCAAGGAATA, 
CGACTTCAGT TCTTCCTTAG GTGGATGTTC TTGCTTTTGA AGTTCCTTAT 
LRH GEEF DLQ FIF QLCK. 
1101 CTTGAGACAC GGTGAAGAAT TCGACTTGCA ATTCATCTTC CAATTGTGTA 
GAACTCTGTG CCACTTCTTA AGCTGAACGT TAAGTAGAAG GTTAACACAT 
ITL TAD VMTY IH K MDA 
1151 AGATCACCTT GACCGCTGAC GTCATGACTT ACATCCACAA GATGGACGCT 
TCTAGTGGAA CTGGCGACTG CAGTACTGAA TGTAGGTGTT CTACCTGCGA 
TILE DWQ FGL TPPP SAS 
1201 ACTATCTTGG AAGACTGGCA ATTCGGTTTG ACTCCACCAC CATCCGCTTC 
TGATAGAACC TTCTGACCGT TAAGCCAAAC TGAGGTGGTG GTAGGCGAAG 
LED TYRF VTS T A I TCQK 
1251 CTTGGAAGAC ACTTACAGAT TCGTCACTTC CACTGCTATC ACCTGTCAAA 
GAACCTTCTG TGAATGTCTA AGCAGTGAAG GTGACGATAG TGGACAGTTT 



FIG.2B 
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NTP PKG KEDP LKD YMF 
1301 AGAACACTCC ACCAAAGGGT AAGGAAGACC CATTGAAGGA CTACATGTTC 

TCTTGTGAGG TGGTTTCCCA TTCCTTCTGG GTAACTTCCT GATGTACAAG 

WEVD LKE KFS ADLD QFP 
1351 TGGGAAGTCG ACTTGAAGGA AAAGTTCTCT GCTGACTTGG ACCAATTCCC 

ACCCTTCAGC TGAACTTCCT TTTCAAGAGA CGACTGAACC TGGTTAAGGG 
LGR K FLL QAG LQA RPKL 
1401 ATTGGGTAGA AAGTTCTTGT TGCAAGCTGG TTTGCAAGCT AGACCAAAGT 

TAACCCATCT TTCAAGAACA ACGTTCGACC AAACGTTCGA TCTGGTTTCA 
KRP ASS APRT STK KKK 
1451 TGAAGAGACC AGCTAGCTCT GCTCCAAGAA CTTCCACCAA GAAGAAGAAG 

ACTTCTCTGG TCGATCGAGA CGAGGTTCTT GAAGGTGGTT CTTCTTCTTC 

V K R * (SEQ ID N0:2) 
1501 GTCAAGAGAT AA (SEQ ID N0:1) 

CAGTTCTCTA TT (SEQ ID NO: 7) 



FIG.2C 
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Expression of HPV 52 L1 wt and 52 L1 R Transcripts 




Full length L1 mRNA 



FIG. 3 
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Transmission EM of VLPs Composed of HPV 52 L1 R Protein Molecules 




FIG. 5 



